A General Class of No-Regret Learning Algorithms and Game-Theoretic Equilibria
نویسندگان
چکیده
A general class of no-regret learning algorithms, called no-Φ-regret learning algorithms, is defined which spans the spectrum from no-external-regret learning to no-internal-regret learning and beyond. The set Φ describes the set of strategies to which the play of a given learning algorithm is compared. A learning algorithm satisfies no-Φ-regret if no regret is experienced for playing as the algorithm prescribes, rather than playing according to any of the strategies that arise by transforming the algorithm’s play via the elements of Φ. We establish the existence of no-Φ-regret algorithms for the class of linear transformations. Analogously, a class of game-theoretic equilibria, called Φ-equilibria, is defined, and it is shown that the empirical distribution of play of no-Φ-regret algorithms converges to the set of Φ-equilibria. A necessary condition for convergence to this set is also derived. Perhaps surprisingly, for the class of linear transformations, the strongest form of no-Φ-regret learning is no-internal-regret learning. Hence, the tightest game-theoretic solution concept to which such no-Φ-regret algorithms converge is correlated equilibrium. In particular, Nash equilibrium is not a necessary outcome of learning via such no-Φ-regret algorithms.
منابع مشابه
No-Φ-Regret: A Connection between Computational Learning Theory and Game Theory
This paper explores a fundamental connection between computational learning theory and game theory through a property we call no-Φ-regret. Given a set of transformations Φ (i.e., mappings from actions to actions), a learning algorithm is said to exhibit no Φ-regret if an agent experiences no regret for playing the actions the algorithm prescribes, rather than playing the transformed actions pre...
متن کاملLimits and Limitations of No-Regret Learning in Games
We study the limit behavior and performance of no-regret dynamics in general game theoretic settings. We design protocols that achieve both good regret and equilibration guarantees in general games. In terms of arbitrary no-regret dynamics we establish a strong equivalence between them and coarse correlated equilibria. We examine structured game settings where stronger properties can be establi...
متن کاملOn the convergence of no-regret learning in selfish routing
We study the repeated, non-atomic routing game, in which selfish players make a sequence of routing decisions. We consider a model in which players use regret-minimizing algorithms as the learning mechanism, and study the resulting dynamics. We are concerned in particular with the convergence to the set of Nash equilibria of the routing game. No-regret learning algorithms are known to guarantee...
متن کامل4 Learning , Regret minimization , and Equilibria
Many situations involve repeatedly making decisions in an uncertain environment: for instance, deciding what route to drive to work each day, or repeated play of a game against an opponent with an unknown strategy. In this chapter we describe learning algorithms with strong guarantees for settings of this type, along with connections to game-theoretic equilibria when all players in a system are...
متن کاملLearning Nash Equilibria in Congestion Games
We study the repeated congestion game, in which multiple populations of players share resources, and make, at each iteration, a decentralized decision on which resources to utilize. We investigate the following question: given a model of how individual players update their strategies, does the resulting dynamics of strategy profiles converge to the set of Nash equilibria of the one-shot game? W...
متن کامل